Search Results

Documents authored by Siklósi, Borbála

Combining Language Independent Part-of-Speech Tagging Tools

Authors: György Orosz, László János Laki, Attila Novák, and Borbála Siklósi

Published in: OASIcs, Volume 29, 2nd Symposium on Languages, Applications and Technologies (2013)

Part-of-speech tagging is a fundamental task of natural language processing. For languages with a very rich agglutinating morphology, generic PoS tagging algorithms do not yield very high accuracy due to data sparseness issues. Though integrating a morphological analyzer can efficiently solve this problem, this is a resource-intensive solution. In this paper we show a method of combining language independent statistical solutions -- including a statistical machine translation tool -- of PoS-tagging to effectively boost tagging accuracy. Our experiments show that, using the same training set, our combination of language independent tools yield an accuracy that approaches that of a language dependent system with an integrated morphological analyzer.

Cite as

György Orosz, László János Laki, Attila Novák, and Borbála Siklósi. Combining Language Independent Part-of-Speech Tagging Tools. In 2nd Symposium on Languages, Applications and Technologies. Open Access Series in Informatics (OASIcs), Volume 29, pp. 249-257, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2013)

Copy BibTex To Clipboard

  author =	{Orosz, Gy\"{o}rgy and Laki, L\'{a}szl\'{o} J\'{a}nos and Nov\'{a}k, Attila and Sikl\'{o}si, Borb\'{a}la},
  title =	{{Combining Language Independent Part-of-Speech Tagging Tools}},
  booktitle =	{2nd Symposium on Languages, Applications and Technologies},
  pages =	{249--257},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-52-1},
  ISSN =	{2190-6807},
  year =	{2013},
  volume =	{29},
  editor =	{Leal, Jos\'{e} Paulo and Rocha, Ricardo and Sim\~{o}es, Alberto},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{},
  URN =		{urn:nbn:de:0030-drops-40441},
  doi =		{10.4230/OASIcs.SLATE.2013.249},
  annote =	{Keywords: part-of-speech tagging, combination, agglutinative languages, machine learning, machine translation}
Questions / Remarks / Feedback

Feedback for Dagstuhl Publishing

Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail